Noise Reduction by Maximum a Posteriori Spectral Amplitude Estimation with Supergaussian Speech Modeling

نویسندگان

  • Thomas Lotter
  • Peter Vary
چکیده

ESTIMATION WITH SUPERGAUSSIAN SPEECH MODELING Thomas Lotter and Peter Vary Institute of Communication Systems and Data Processing ( ) Aachen University (RWTH), Templergraben 55, D-52056 Aachen, Germany E-mail: lotter vary @ind.rwth-aachen.de ABSTRACT This contribution presents a spectral amplitude estimator for acoustical background noise suppression based on maximum a posteriori estimation and supergaussian statistical modeling of the speech DFT coefficients. The probability density function of the speech spectral amplitude is modeled with a simple parametric function, which allows a high approximation accuracy for Laplace or Gamma distributed real and imaginary parts of the speech DFT coefficients. Based on the approximation, a computationally efficient maximum a posteriori speech estimator is derived, which outperforms the Ephraim-Malah algorithm in a single channel noise reduction framework.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Speech Enhancement by MAP Spectral Amplitude Estimation Using a Super-Gaussian Speech Model

This contribution presents two spectral amplitude estimators for acoustical background noise suppression based on maximum a posteriori estimation and super-Gaussian statistical modelling of the speech DFT amplitudes. The probability density function of the speech spectral amplitude is modelled with a simple parametric function, which allows a high approximation accuracy for Laplaceor Gamma-dist...

متن کامل

Noise Power Spectral Density Estimation based on Maximum a Posteriori and Generalized Gamma Distribution

Noise power spectral density (PSD) estimation is a crucial part of speech enhancement system due to its contributory effect on the quality of the noise reduced speech. A novel estimation method for color noise PSD on the basis of an assumption of generalized Gamma distribution and maximum a posteriori (MAP) criterion is proposed. In the experiment, generalized Gamma PDF which is a natural exten...

متن کامل

Supergaussian Garch Models

In this paper, we introduce supergaussian generalized autoregressive conditional heteroscedasticity (GARCH) models for speech signals in the short-time Fourier transform (STFT) domain. We address the problem of speech enhancement, and show that estimating the variances of the STFT expansion coefficients based on GARCH models yields higher speech quality than by using the decision-directed metho...

متن کامل

Proceedings of Meetings on Acoustics

Estimation of the power spectral density (PSD) of noise is crucial for retrieving speech in a noisy environment. 3 novel methods for estimating the non-white noise PSD of noisy speech based on a generalized gamma distribution and 3 criterions are proposed, which are minimum mean square error (MMSE), maximum a posteriori (MAP) and Maximum likelihood estimation (MLE). Because of the highly non-st...

متن کامل

Speech spectral modeling and enhancement based on autoregressive conditional heteroscedasticity models

In this paper, we develop and evaluate speech enhancement algorithms, which are based on supergaussian generalized autoregressive conditional heteroscedasticity (GARCH) models in the short-time Fourier transform (STFT) domain. We consider three different statistical models, two fidelity criteria, and two approaches for the estimation of the variances of the STFT coefficients. The statistical mo...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003